Assessment of Different Workflow Strategies for Annotating Discourse Relations: A Case Study with HDRB
نویسندگان
چکیده
In this paper we present our experiments with different annotation workflows for annotating discourse relations in the Hindi Discourse Relation Bank(HDRB). In view of the growing interest in the development of discourse data-banks based on the PDTB framework and the complexity associated with the discourse annotation, it is important to study and analyze approaches and practices followed in the annotation process. The ultimate goal is to find an optimal balance between accurate description of discourse relations and maximal inter-rater reliability. We address the question of the choice of annotation work-flow for discourse and how it effects the consistency and hence the quality of annotation. We conduct multiple annotation experiments using different work-flow strategies, and evaluate their impact on inter-annotator agreement. Our results show that the choice of annotation work-flow has a significant effect on the annotation load and comprehension of discourse relations for annotators, as reflected in the inter-annotator agreement results.
منابع مشابه
Evaluation of Discourse Relation Annotation in the Hindi Discourse Relation Bank
We describe our experiments on evaluating recently proposed modifications to the discourse relation annotation scheme of the Penn Discourse Treebank (PDTB), in the context of annotating discourse relations in Hindi Discourse Relation Bank (HDRB). While the proposed modifications were driven by the desire to introduce greater conceptual clarity in the PDTB scheme and to facilitate better annotat...
متن کاملExperiments with Annotating Discourse Relations in the Hindi Discourse Relation Bank
In the Hindi Discourse Relation Bank (HDRB) project, we are developing a large corpus annotated with discourse relations, such as causal, temporal, contrastive and conjunctive relations. Adopting the lexically grounded approach of the Penn Discourse Treebank (PDTB), we annotate the argument structure of both explicit and implicit discourse relations, as well as the senses of relations. We descr...
متن کاملConcurrent Discourse Relations
The Penn Discourse Treebank (PDTB) was released to the public in 2008 and remains the largest corpus of manually annotated discourse relations — both relations that are signaled explicitly (e.g., by a coordinating or subordinating conjunction, or by a discourse adverbial or other construction) and ones that otherwise appear implicit. The Penn Discourse TreeBank also diverges from other discours...
متن کاملAnnotating Discourse Relations with the PDTB Annotator
The PDTB Annotator is a tool for annotating and adjudicating discourse relations based on the annotation framework of the Penn Discourse TreeBank (PDTB). This demo describes the benefits of using the PDTB Annotator, gives an overview of the PDTB Framework and discusses the tool’s features, setup requirements and how it can also be used for adjudication.
متن کاملصورتبندی گفتمانیِ ابن جوزی برابر صوفیان در تلبیس ابلیس
Human perceptions of phenomena in the word and influenced the discourse. Approach to critical discourse analysis in creating effective access relations provides good models for the study of language¸ ideologies and power relations between decoding and explained constructive dialogue between the text and the perspective of social, political, and how they deal with competing discourses reveals, i...
متن کامل